APPARATUS AND METHOD FOR 
ACCESSING A BRANCH TARGET BUFFER 

The present invention relates to computer systems and to 
branch target buffers for use in the supply of instructions to 
a processor of a computer system. 

BACKGROUND OF THE INVENTION 

To run a computer program, a computer repetitively 
carries out a sequence of functions, typically fetching an 
instruction held at a given address, decoding the instruction, 
accessing an operand for use by the instruction, executing 
the instruction, storing the result of the execution and 
determining the next instruction address. A problem will 
occur where an instruction contains a test whose result 
determines the address of the next instruction to be executed. 
An instruction of this type is known as a conditional jump. 

The consequence of the presence of a conditional jump 
instruction is that the instruction typically has to pass 
through several pipeline stages before the test is resolved, 
and before the next instruction to be fetched can be deter- 
mined with certainty. This can delay the pipeline process. 

A program sequence may also include non-conditional 
jump instructions. Such instructions, if executed, result in 
the program sequence jumping to a new instruction. 

For the purposes of the present description and claims, the 
term "branch instruction" will be used to include both 
conditional and non-conditional jump instructions. 

In this specification the term instruction includes primi- 
tive operations which may be included in a VLIW system 
using Very Long Instruction Words. An instruction word or 
sequence may therefore comprise a VLIW instruction. 

SUMMARY OF THE INVENTION 

The invention provides a method of writing a branch 
instruction into a branch target buffer which comprises a 
CAM array for holding fetch address data for a plurality of 
branch instructions and a RAM array for holding target 
address data for each of said branch instructions, said branch 
instructions being stored in a computer system at address- 
able locations having a fetch address and each said branch 
instruction providing a target address, said method compris- 
ing comparing first selected bits of the fetch address of the 
branch instruction to be written, against first selected bits of 
the target address of said branch instruction, and in response 
to a match between said bits, writing second selected bits of 
the fetch address of said instruction to a CAM location in 
said CAM array, whereby said second selected bits of the 
fetch address constitute said fetch address data indicative of 
the fetch address of said branch instruction, and writing 
second selected bits of the target address to a corresponding 
RAM location in said RAM array, whereby said second 
selected bits of the target address constitute said target 
address data indicative of the target address of said branch 
instruction, wherein said first selected bits and said second 
selected bits represent parts of different significance in the 
fetch address. 

A write operation into said branch target buffer is not 
completed if on comparing said first selected bits of the fetch 
address and the target address no match is found. 

Preferably said first selected bits of the fetch address and 
said first selected bits of the target address represent the bits 
of higher significance of said address and said second 
selected bits of the fetch address and the second selected bits 
of the target address represent the least significant bits of the 
address. 



The invention also provides a method of operating a 
partitioned cache memory as a branch target buffer in which 
the CAM array of each partition is arranged to hold only 
selected bits of a fetch address of each branch instruction 

5 and the RAM array of each partition is arranged to hold only 
selected bits of the target address of branch instructions in 
the buffer, wherein said selected bits of both the fetch 
address of a branch instruction and of the target address for 
a branch instruction relate to the least significant bits of the 

10 respective address. 

This enables a significant reduction in the storage capacity 
and overall size necessary for the CAM array and data 
RAMs. 

Preferably a read operation of said branch target buffer is 

15 effected by providing two input signals relating to fetch 
address data, a first input comprises some of said selected 
bits of a fetch address to test for a cache hit in any cache 
partition responding to said first input, and a second input 
corresponding to the remainder of said selected bits of the 

20 fetch address, said second input controlling masking cir- 
cuitry to prevent a cache hit output from any partition not 
corresponding to data in said second input. 
The invention includes a branch target buffer comprising 

25 a partitioned cache memory, each partition comprising a 
CAM array for holding selected bits of the fetch address of 
a plurality of branch instructions held in the buffer, a RAM 
array in each partition coupled to said CAM array and 
arranged to hold selected bits of a fetch address of branch 

30 instructions held in the buffer, said selected bits of the fetch 
address and selected bits of said target address each com- 
prising the least significant bits of the respective address, 
and comparator circuitry for comparing the most significant 
bits of a fetch address and the most significant bits of the 

35 corresponding target address to control entry of a branch 
instruction into the buffer in dependence on a match between 
said most significant bits being found by said comparison 
circuitry. 

BRIEF DESCRIPTION OF THE DRAWINGS 

40 

FIG. 1 shows a simplified example of a series of instruc- 
tions; 

FIG. 2 shows a partial schematic diagram of circuitry for 
implementing the method of this invention; 
45 FIG. 3 shows a partial block diagram of an embodiment 
of a branch target buffer for implementing the invention; 

FIG. 4 shows an illustrative circuit diagram of read 
circuitry of a partition of the branch target buffer of FIG. 3; 
50 FIG. 5 shows the content of CAM cells and data RAMs 
of the branch target buffer of FIG. 3 when handling the 
instructions of FIG. 1; 

FIG. 6 shows a first example of updating the branch 
predicted values for an instruction; 
55 FIG. 7 shows a second example of updating branch 
prediction values for an instruction; 

FIG. 8 shows write circuitry for one line of a partition of 
the branch target buffer of FIG. 3 connected to part of a 
6Q computer system, and 

FIG. 9 shows a computer system including the branch 
target buffer of FIG. 3. 

DESCRIPTION OF THE PREFERRED 
EMBODIMENT 

65 

FIG. 1 shows a series of twenty one-byte instructions 
0-19 in a computer program. As seen in the figure, each 
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instruction is represented by an instruction number (0-19) 
and by an address indicative of a storage location at which 
the instruction is stored and which is represented by the 
binary equivalent of the instruction number. The instructions 
are illustratively grouped into sequences of four instructions 
101, 102, 103, 104, 105. Each of these groups is termed an 
instruction word. Each instruction word has a single word 
address, that of instruction word 101 being "000", that of 
instruction word 102 being "001", and so on. Each instruc- 
tion has a byte address indicating its position within the 
word, so that the first instruction of each word has byte 
address "00", the second "01" and so on. The majority of the 
instructions, specifically instructions 0-4, 6, 8, 10, 11, 13 
and 15-19 lead directly onto the next sequentialinstruction; 
these instructions are marked f X\ Instruction- 12 (marked 
'j') is termed an unconditional jump, because if execution 
reaches instruction 12 the result is to jump to instruction 16. 
Instructions 5, 7, 9 and 14 (marked 'cj') are each termed 
conditional jumps. The next instruction to be executed after 
a conditional jump typically depends upon a condition 
tested, and may for instance be determined by whether the 
value of the operand, stored as a result of the instruction 
which immediately precedes the conditional jump in time, 
exceeds a given value. The instruction to which the jump 
will occur if the condition is met is indicated in the figure. 
Thus instruction 5 will either be succeeded in time by 
instruction 6, or by instruction 14 depending on the value 
returned by instruction 4. Instruction 7 will be succeeded by 
instruction 8 or by instruction 0 depending upon the value 
returned by instruction 6. Instruction 9 will be succeeded by 
instruction 13 or by instruction 10 depending upon the value 
returned by instruction 8. Instruction 14 will be succeeded 
either by instruction 15, or by instruction 2 depending upon 
the value returned by instruction 13. 

In some computers, the instructions will be fetched from 
store individually, with instruction 0 fetched first, and then 
instruction 1 fetched, and so on. However, in certain 
computers, it has been proposed to associate plural instruc- 
tions together in sequence called instruction words, so that 
fetching from an addressable location makes available the 
plural instructions of the word for execution. In the present 
description, the term instruction word is not intended to be 
limitative and is used for convenience to refer to a plurality 
of instructions, or one VLTW instruction, which may be 
fetched by accessing a single address in store. 

In the example of instruction words shown in FIG. 1, the 
first instruction word 101 contains no branch instructions, ie. 
no conditional jumps and no unconditional jumps. The 
second instruction word 102 contains two conditional jump 
instructions, instructions 5 and 7. The third instruction word 
103 contains one conditional jump instruction, instruction 9. 
The fourth instruction word 104 contains one unconditional 
jump instruction, instruction 12 and one conditional jump 
instruction, instruction 14. The fifth instruction word con- 
tains no jump or conditional jump instructions. 

It will be appreciated that the program flow due to 
executing the first instruction word 101 will always be the 
same, namely executing the first instruction 4 of second 
instruction word 102. Executing the second instruction word 
102 causes three possible program flows, namely executing 
the first instruction 8 of third instruction word 103 (if no 
conditional jump is effected), executing the second instruc- 
tion 13 of the fourth instruction word 104 (if the conditional 
jump at instruction 5 is effected) or executing the first 
instruction 0 of first instruction word 101 (if the conditional 
jump at instruction 7 is effected). Similarly, the outcome of 
executing the third instruction word 103 is either to execute 
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the first instruction 12 of fourth instruction word 104 (if the 
conditional jump at instruction 9 is not effected) or executing 
the second instruction 13 of fourth instruction word 104 (if 
conditional jump instruction 9 is effected). Finally, the 
outcome of executing fourth instruction word 104 is either 
executing instruction 16, which occurs if the first instruction 
12 of instruction word 104 has been executed or if the 
conditional jump instruction 14 is not effected, or the 
alternative outcome is a return to instruction 2 of first 
instruction word 101. 

For optimum speed of operation, the computer, having 
fetched a given instruction word for execution, should next 
fetch the correct instruction word, in the sense of the 
instruction word containing the next instruction which is 
required to be executed. However, it will be seen from the 
foregoing that the identity of the next instruction to be 
executed may vary if the current instruction word contains 
any branch instruction. Take for example fourth instruction 
word 104: 

Instruction word 104 may be fetched either in response to 
executing instruction 9 or instruction 11 of third instruction 
word 103 or to executing instruction 5 the second instruction 
word 102. The next correct instruction word to fetch after 
word 104 is either first instruction word 101 or fifth instruc- 
tion word 105. By examining fourth instruction word 104, 
the reader will note that if instruction 11 of third instruction 
word 103 were executed, then the first instruction 12 of 
fourth instruction word 104 would be executed next, and the 
result of that would be that fifth instruction word 105 is the 
next word to fetch. If however fourth instruction word 104 
were fetched as a result of the conditional jump instruction 
9 or the conditional jump instruction 5, the identity of the 
next correct instruction word to fetch depends upon the 
outcome of the conditional jump in instruction 14. 

Continuing to consider the fourth instruction word 104, 
fetching a wrong word (for example, first instruction word 
101 being fetched instead of fifth instruction word 105) 
would result in the processing pipeline containing a number 
of wrong instructions which would of course not be 
executed. The practical consequence of this would be that a 
number of cycles would be lost, during which no useful 
execution took place; only on a subsequent correct cycle 
would the correct fifth instruction word 105 be executed, 
after being called up into the pipeline. 

A simplified device illustrating some of the features of the 
present invention will now be described with respect to FIG. 
2. 

Referring to FIG. 2, the device comprises memory cir- 
cuitry 200 which consists of a memory array 205 having a 
plurality of addressable locations and an address decoder 
210 for addressing the memory array 205. The address 
decoder 210 has an input 201 over which it receives instruc- 
tion addresses within a single instruction word. The address 
decoder produces address outputs corresponding to the input 
instruction address and all later instruction addresses up to 
the end of the relevant instruction word. The number n of 
storage locations in the memory array 205 is at least equal 
to the number of instructions in the program sequence 
currently being run. The memory locations of the array 205 
store at the address of respective instructions, information 
showing whether the instruction is a branch instruction or 
not, i.e. a logical 0 where the instruction is not a branch 
instruction, and a logical 1 if the instruction is a branch 
instruction. Each memory location has a respective output 
line 220 1 -220„. The output lines 220J-220,, are coupled to 
a register arrangement 250, having storage locations corre- 
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sponding to each of the lines 220 1 -220„, for storage of 
branch prediction values. Each register location has a 
respective input formed by one of the lines 220 a -220 w and 
a respective output line 2i80 1 -280 n . The register locations 
store logical 1 where an associated branch instruction is 5 
predicted as taken, and a logical 0 where an associated 
branch instruction is predicted as not taken. When an 
instruction address is input over input lines 201, the address 
decoder 210 addresses the corresponding locations of the 
array 205 and, where an instruction at a corresponding 30 
address is a branch instruction, there will be a logical 1 
output on the corresponding one of the output lines 
220 : -220 n which in turn accesses the register 250 and 
produces on an output line 280 either a corresponding 
logical 1 if the branch is predicted as taken, or a correspond- 15 
ing logical 0 if the branch is predicted as not taken. Where 
one or more of the addressed locations is not a branch 
instruction, logical zeros will be output over the correspond- 
ing output lines 220 1 -220„. 

The logic stage stored in register location corresponding 2 o 
to non-branch instructions is not significant because no 
logical 1 can occur on a line 220i-220 n unless the associated 
instruction is a branch instruction. As a result the output for 
each location which corresponds to a non-branch instruction 
will always be logical 0. The output lines 280 1 -280„ of the 2 s 
register 250 form word lines to a store 300 which stores 
target addresses of branch instructions. The store 300 has a 
first address output 301 at which the store 300 delivers the 
target address of a branch instruction, and a second output 
302 which provides a logical 1 when any branch instruction 30 
is predicted as taken. The store 300 has one row 300 1 -300 n 
for each word line 280 1 -280 n , and each row contains 
memory cells connected to the output lines 301 so that 
application of a logical 1 to one of the word lines 280 1 -280 n 
produces an address on output lines 301. Store circuitry 300 35 
also contains gating circuitry having an output to the second 
output line 302 and producing a logical 1 at output line 302 
when a branch is predicted as taken. The store circuitry 300 
further contains decision circuitry which provides "only the 
target address of the first jump instruction from the word 4 q 
which is predicted as taken. 

Before operating the device of FIG. 2, it is initialised by 
sequentially addressing the memory array 205 and storing a 
logical 1 at the address locations of memory array 205 which 
correspond to branch instructions. The register circuitry 250 45 
is loaded, during the initialising stage, with prediction 
information indicating whether or not the associated branch 
instructions are predicted taken. The store circuitry 300 is 
loaded with address information corresponding to the target 
addresses of the branch instructions stored in memory array 50 
205. The circuitry necessary for effecting the initialisation of 
the circuitry of FIG. 2 is not shown. 

Once the initialisation has been effected, the program 
sequence can be run. To do this, the address of the first 
instruction, instruction 0 of first instruction word 101 is fed 55 
to input 201 to memory circuitry 200. This causes address- 
ing of the memory array 205 at all locations corresponding 
to instructions in the first word. Because the first instruction 
word 101 contains no jump instructions, no logical 1 will be 
output. Hence the output lines 220j-220„ will all carry 60 
logical zero, and these logical zeros are applied to the 
register 250. The result of applying all logical zeros to the 
inputs of the register store 250 is to provide on output lines 
280!-280„ outputs comprising all logical zeros. The store 
circuitry 300 receives the logical zeros and provides an 65 
output of logical zero at the first output 301 and an output of 
logical zero at the second output 302. 
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In the present example, the first instruction word 101 
contains no branches and thus no target address is output at 
output 302. Accordingly execution proceeds with the fetch- 
ing of the second word 102. 

Accordingly the next instruction word to be fetched is the 
second instruction word 102. For the second instruction 
word 102, the memory circuitry stores a logical 1 at the 
address corresponding to instruction 5 at byte position 2 the 
byte positions in each word being designated 1 to 4) of that 
instruction word and a logical 1 at the address corresponding 
to instruction 7, at byte position 4 of that instruction word. 
Logical one is output from the memory circuitry over the 
output line 220 1 -220„ corresponding to the second position 
and over the output line 220!-220„ corresponding to the 
fourth position in the second instruction word 102. These 
logical one inputs are provided to the register 250 which in 
turn provides logical one outputs over those register output 
lines 280 1 -280 n which correspond to branch instructions 
which are predicted as to be taken. Thus, the one of the 
register output lines 280 a -280 n corresponding to the second 
position of the second instruction word 102 will carry a 
logical one if it is predicted that the conditional jump 
instruction 5 will be taken and the line corresponding to the 
fourth instruction position of second instruction word 102 
will carry a logical one if it is predicted that the conditional 
jump instruction 7 is effected. 

These logical one inputs are provided to the store circuitry 
300 so as to read the target addresses of the predicted-taken 
branches and the decision circuitry outputs the target address 
of the first occurring predicted -taken branch instruction. 
This is because if, for example, conditional jump instruction 
5 were predicted as taken, instructions 6 and 7 cannot be 
executed if the prediction is correct. Thus the earliest 
occurring predicted-taken branch in an instruction word 
determines the next instruction to be fetched. 

The target address is used by the processor of the 
computer, to cause a new instruction word to be fetched 
containing the instruction at which execution is predicted to 
proceed. 

In this simplified embodiment as mentioned previously, 
where execution of the instructions in an instruction word 
starts other than at the first instruction of that word, the 
address of the initial instruction is input over inputs 201 and 
the address decoder 210 only applies addresses correspond- 
ing to the remaining instructions of the word to the memory 
array 205. Thus, for example described with respect to FIG. 
1, if execution of the fourth instruction word 104 were to 
commence at instruction number 13, (for example in 
response to the conditional jump at instruction 9), the 
address corresponding to instruction 13 would be input over 
input lines 201, and address circuitry 210 would then apply 
the addresses of instructions 13, 14 and 15 to the memory 
array 205. As a result, the branch instruction at instruction 12 
(which is an unconditional jump, and is always therefore 
predicted as taken) would not be presented on one of the 
memory array output lines 220 1 -220 n . 

The above description is intended to illustrate the con- 
cepts of this invention. One embodiment of circuitry for 
carrying out the method of the present invention, this 
circuitry including a branch target buffer, will now be 
described with reference to FIGS. 3-9. 

Referring firstly to FIG. 3, a branch target buffer 400 
consists of four generally similar sections 401-404, each 
referred to hereinafter as a partition. 

The number of partitions of the present embodiment 
corresponds to the number of instructions per instruction 
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word, and in the example described above with reference to 
FIG. 1, this number is 4. It will of course be understood that 
other numbers could be used according to the content of the 
instruction word; specifically in a more complex system 8 
partitions could be used. It will also be appreciated by one 
skilled in the art that fewer partitions could be provided. For 
example, it would be possible for the four-byte example of 
FIG. 1 to only provide two partitions, one corresponding to 
the first two bytes of each instruction word, and the other 
corresponding to the second two bytes of each instruction 
word. 

In the present example first partition 401 stores data 
relevant to branch instructions which are located at the first 
byte of a word stored in the buffer, second partition 402 
stores data relevant to the branch instructions at second byte 
locations and so on. First partition 401 is referred to herein 
as the lowest partition, and fourth partition 404 as the 
highest partition. 

Each partition of the buffer 400 includes a CAM array 
holding a plurality of word addresses and associated data 
RAMs holding target addresses for branch instructions 
located at the word addresses stored in the CAMs. The buffer 
has first and second input buses 500,501 giving the address 
of an instruction being fetched. The first input bus 500 is an 
instruction word address bus receiving the most significant 
bits of the instruction address and the second input bus 501 
is an instruction byte address bus for the least significant bits 
of the address of the instruction being fetched. In the 
example of FIG. 1, the first three bits of the instruction 
address form the instruction word address and the lowest 
two bits form the instruction byte address. An input on bus 
500 therefore enables an associate operation to determine if 
the buffer holds data corresponding to the word address of 
the instruction being fetched. The input 501 is used to 
control the output of the CAM arrays during a read operation 
so as to form a masking operation which prevents output 
from any partition CAM corresponding to a byte location 
within the word before the byte position input on line 501 
(that is when the input on 501 is greater than the partition 
number). If a read operation in the buffer 400 has a hit for 
the relevant instruction identified by the inputs 500 and 501, 
the buffer provides a target address output on bus 507, an 
output 505, being a bus carrying information indicating 
which partitions contain branch instructions from the current 
instruction word which are predicted taken, and an output 
504 carrying information on the "prediction strength" 
obtained from each partition. 

It will be understood that when a processor outputs an 
address to memory to fetch a new instruction, that address 
is simultaneously fed to the buffer 400 to carry out a read 
operation and if the processor executes a branch instruction 
which ■ is not in the buffer data relating to that branch 
instruction will be written into the buffer after execution by 
the processor so that it is ready for use at some later time. 
To write data into the buffer a word address input bus 900 
is provided to write word addresses into the CAM arrays, a 
target address input 908 is provided to write into the data 
RAMs target addresses for the branch instructions which are 
newly written into the buffer. As will be later described, any 
write operation into the buffer is carried out simultaneously 
with a read operation using the same word address and input 
900 as a read input 500 together with the appropriate byte 
address 501 to avoid writing into the buffer any instruction 
which is already held in the buffer. Other inputs and outputs 
are shown in FIG. 3 and their function will be described with 
reference to the more detailed drawings of FIGS. 4, 8 and 9. 

As will be seen with reference to FIG. 3, each of the 
partitions 401-404 is substantially similar and therefore a 
detailed description of one exemplary partition 401 only will 
be given. 
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Referring to FIG. 4, partition 401 consists of n content 
addressable memory cells 510^510,, coupled to the instruc- 
tion word address input bus 500, having a first plurality of 
CAM output lines Sllj-511,,. As will be later described 

5 herein, the present embodiment of the branch target buffer is 
operated dynamically, in the sense that once all of the lines 
except one contain data the partition is regarded as "full". 
The one unfilled line is retained for writing a new entry, and 
as part of the write operation, details of one branch instruc- 

10 tion stored in the branch target buffer are discarded so as to 
be ready for input of a next newly-found branch instruction. 
A branch instruction is said to be "newly-found" if it does 
not exist in the branch target buffer at the time of testing for 
presence of the branch in the buffer. It will be noted that a 

15 branch instruction which was discarded from the branch 
target buffer in one cycle of operation may become a 
"newly-found" instruction during a later cycle of operation. 
Thus, whereas increasing the number n of content address- 
able memory cells increases the complexity and size of the 

20 device, such an increase tends to reduce the number of 
occasions on which a branch instruction needs to be written 
in. 

Decreasing the number n of memory cells provides a 
smaller and simpler device, but with the penalty that the 

25 chances of failing to find a branch instruction are increased. 
Where a jump instruction is not found this tends to lead to 
a processing delay. 

In the presently described embodiment, content address- 
able memory cells SIO^IO,, are capable of storing the 

30 word addresses input over the instruction word address bus 
500. As will be described later herein, certain instruction 
word addresses are stored in the content addressable 
memory cells. The instruction word address of the instruc- 
tion currently being fetched is input over the instruction 

35 word address bus 500 and when an instruction word address 
corresponding to one of the stored addresses is input a 
logical 1 occurs on the corresponding one of the CAM 
output tines 511 1 -511„. 
Selection circuitry 512 has an input connected to the 

40 instruction byte address bus 501 and is connected to the 
CAM output lines 511 1 -5U„ so as to selectively disable all 
of the lines of a partition. The selection circuitry 512 has a 
processing circuit 560 receiving one input from the instruc- 
tion byte address bus 501 and having an output 561 con- 

45 nected to the control inputs of a plurality of pull down 
transistors 562 1 -562 n . Each of the pull down transistors is 
connected between a respective one of the CAM output lines 
Slli-51^ and earth. During reading, a control input 912 
sets the processing circuitry 560 to produce a high output so 

50 as to turn on all of the pull down transistors 562 1 -562 n in 
response to the byte address input over the instruction byte 
address bus 501 which is greater than the number of the 
partition. In mis way the circuits 512-812 act to mask out 
these partitions where the stored branch instruction has a 

55 byte location before the byte position indicated on line 501. 
The selection circuitry 512 has output lines 513^5 13„ 
which form inputs to prediction processing circuitry 514. 
The prediction processing circuitry 514 includes an 
n-location store 563 for storing prediction information for 

60 indicating whether an associated branch instruction is pre- 
dicted taken or not taken. Each of the store locations 
563 1 -563 rt corresponds to a respective CAM cell, and has an 
output connected to one input of a respective two input AND 
gate 564j-564 M , the other input of which is provided by a 

65 respective output line 513 1 -513 M of the selection circuitry 
512. As shown in FIG. 8, an OR gate 565 is connected 
between the respective store locations 563 and the respective 
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AND gate 564 1 -564„, but as this forms a direct connection 
during reading it is omitted from FIG. 4 for clarity. As will 
later be described herein, the prediction store 563 stores a 
logical 1 in positions corresponding to branch instructions 
which are predicted as taken, and a logical 0 in positions 
corresponding to branch instructions which are predicted as 
not taken. A line 907, described more fully with reference to 
FIG. 8, allows the writing of new prediction information to 
the prediction store. Each AND gate 564 1 -564 n has a 
respective output which is connected to a respective output 
line 515 1 -515 n of the prediction processing circuitry 514. As 
shown in FIG. 8, there is an OR gate 905 between the output 
of each AND gate 564 1 -564„ and the output line 515J-515,,, 
but as this forms a direct connection during reading, it is 
omitted from FIG. 4 for clarity. The output lines 515 1 -515„ 
form the word lines to a RAM 518 and also form inputs to 
an n-input NOR gate 516. The NOR gate 516 is connected 
to a single output line 517 which controls a first transmission 
gate 531 and a second set of transmission gates 530. The 
NOR gate output lines 517-817 from all of the partitions 
together form the output bus 505. As shown in FIG. 8 the 
output of NOR gate 516 is connected to OR gate 590 which 
is omitted for clarity in FIG. 4 as it has no effect during a 
read operation as the signal on line 920 is 0 during a read. 
During a write, the gate 590 in each partition receives a 
signal 1 on line 920. 

Data RAM 518 consists of n rows of storage cells, each 
row of which is addressed by a respective one of the lines 
515 1 -515 n . Each row of storage cells stores the target 
address of a branch instruction i.e. the address at which 
execution will continue if the branch is taken. This infor- 
mation is made available on target address output bus 503, 
which passes between the partitions, but which contains the 
transmission gates 530 for isolating partitions storing data 
relevant to later bytes in the same word as will be later 
described. Bus 503 is connected to sense amplifier circuitry 
506 having a target address output 507 and a target address 
input 903 used during a write operation. The bus 503 forms 
part of a common data path interconnecting the data RAMs 
with respective bit lines of the data RAMs connected serially 
and with transmission gates in the bit line paths between 
adjacent data RAMs. 

The branch target buffer also consists of prediction- 
strength processing circuitry 550. 

The prediction-strength processing circuitry 550 receives, 
as first inputs, the output lines 513J-513,, of the selection 
circuitry 512 and produces an output on prediction-strength 
output line 504. The prediction strength processing circuitry 
550 further receives as a control input the update strength 
line 502, and also receives inputs from a line 909 for writing 
new prediction strength data. This latter is described with 
reference to FIG. 8. 

Referring again to FIG. 3, and as mentioned above, the 
remaining partitions have similar structures to that of the 
first partition 401. The integers of each partition which are 
similar to those of partition 401 have similar reference 
numerals, the reference numerals of partition 402 being in 
the range 600-699, those of partition 403 being in the range 
70G-799 and those of partition 404 being in the range 
800-899. 

As previously described, between each respective parti- 
tion and the adjacent partition there is a first transmission 
gate 531, 631, 731, 831 and a second set of transmission 
gates 530, 630, 730, 830. Both the first transmission gate and 
the second set of transmission gates are controlled by the 
output lines 517, 617, 717, 817 of the respective NOR gate 
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516, 616, 716, 816 of the associated partition. In any one 
partition, the operation is such that when the respective NOR 
gate 516, 616, 716, 816 receives a logical 1 at any one of its 
inputs, the corresponding output line 517, 617, 717, 817 

5 goes to logical 0, thus rendering the respective first trans- 
mission gates 531, 631, 731, 831 and the respective second 
set of transmission gates 530, 630, 730, 830 non -conductive. 
This has the effect of interrupting the target address output 
bus between the relevant partition and the next higher 

0 partition. The result is that only the lowest partition in which 
there is a logical 1 for the input of the NOR gate 516, 616, 
716, 816, can supply a target address onto the target address 
output bus 503. The output RAMs 518-318 are therefore 
connected serially to a common path to the output 503 and 

15 the control gates 530-830 in that path allocate a decreasing 
priority to partitions progressively moving away from the 
output 503. Any one partition can only provide an output if 
no higher priority partition is providing an output. Once one 
partition provides an output all lower priority partitions are 

2Q blocked and cannot provide an output. 

The operation of the circuitry of FIG. 3 will now be 
described: 

Data relating to branch instructions are stored in a parti- 
tion which corresponds to the byte position of the branch 

25 instruction in its instruction word. For example, data relating 
to a branch instruction in the first byte position of any word 
is stored in first partition 401, data relating to a branch 
instruction in the second byte position in the second partition 
402 and so on. Hence for the example of FIG. 1, partition 

30 402 is relevant to instructions 0, 4, 8, 12 and 16 (the first byte 
address of each instruction word), the second partition 402 
stores and processes information relevant to instructions 1, 
5, 9, 13, 17 and so on. Data indicating the presence of branch 
instructions is stored in the CAM cells of the partition, data 

35 relating to the prediction as to whether the branch is taken 
is stored in the prediction processing circuitry and data 
relating to the target address of the branch instruction in the 
RAM of the partition. In the presently described example, 
the data which indicates the presence of branch instruction 

40 is the full instruction word address of the word containing 
the branch instruction. As will be later described herein, is 
possible to store only a part of the instruction word address. 
The data chosen for indicating the presence of branch 
instructions is stored in the content addressable memory 

45 cells of the respective partition. 

The location of data identifying branch instructions is 
shown in FIG. 5 for the FIG. 1 example. Referring to FIG. 
5, it will be seen that cells 510 of the first partition 401 
contain an entry of "011" because the instruction word 104 

50 (having address "011") has an unconditional jump instruc- 
tion in the first byte position. Similarly the second and third 
instruction words 102,103 (word addresses "001" and 
"010") have conditional jump instructions at the second byte 
position of the instruction words, namely the positions 

55 corresponding to instructions 5 and 9. Thus, the cells 610 of 
second partition 402 store word addresses "001" and "010" 
of words 102 and 103. Entries are similarly made in the third 
partition (for address "011" corresponding to instruction 14) 
and in the fourth partition (corresponding to instruction 7). 

60 Still referring to FIG. 5 in the context of the instructions 
shown in FIG. 1, an input is made to the instruction word 
address bus 500 of the word address of the instruction 
currently being fetched; this address is applied by the bus to 
the CAM cells 510, 610, 710, 810 of the whole device. 

65 Where a match occurs between the word addresses input to 
the bus 500 and an address stored in a CAM cell, a logical 
one output (referred to as a "CAM hit") is provided by the 
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relevant output line of the CAM. If for example the word 
address "000", corresponding to word 101 is input, no 
matches occur in any partition and no CAM hits will occur. 
However, when the word address "001" of the second 
instruction word 102 is input over bus 500, a match occurs 5 
in the CAM cells 610 of the second partition (corresponding 
to instruction 5 — see FIG. 1) and in the CAM cells 810 of 
the fourth partition (for instruction 7 — see FIG. 1). The 
result of these matches is to produce CAM hits on the CAM 
output lines 611, 811 associated with the CAM cells where 10 
the hit occurred. 

An input is made to the byte input address bus 501 
comprising the instruction byte address, which represents 
the position within the instruction word at which execution 
is to commence. For the example shown in FIG. 5, assume 15 
that execution of the program segment shown is to start at 
the third instruction of second instruction word 102 
(instruction 6 in FIG. 1). In this situation, the word address 
input over bus 500 would be "001" corresponding to the 
second instruction word 102. This would produce CAM hits 20 
in the second and fourth partitions. Referring again to FIG. 
1, it will be seen that the byte address for the third instruction 
of each instruction word is "10". It is this address which is 
input on instruction byte address bus 501. 

The selection circuitry 512, 612, 712, 812 includes 25 
respective processing circuitry 560 (see FIG. 4) which acts 
during reading to disable any partitions which correspond to 
bytes lower than the byte address input on instruction byte 
address input bus 501. Thus, for an input byte address of 
"00" no partitions will be disabled, and all selection circuits 30 
will pass any hits from input 511,611 etc to output 513,613 
etc. If the instruction byte address is "01" then selection 
circuitry 512 of the first partition 401 will not pass any hit 
from an input line Sllj-Sll,, to an output line 513J-513,,, 
whereas in other partitions, any hit will be passed from input 35 
to output. If the instruction byte address were "10" then any 
hit would only be passed by selection circuits 712 and 812 
in the third and fourth partitions, and so on. In the present 
case, the instruction byte address is "10", thus enabling only 
selection circuitry 712 and 812 to pass any hit. However, the 40 
only hit which occurs is in the fourth partition and this is 
therefore allowed to proceed as an input to the prediction 
processing circuitry 814 of the fourth partition. 

It will be recalled that the prediction processing circuitry 45 
514,614,714 and 814 each consists of respective prediction 
store 563 which contains a logical 1 or logical 0 indicating 
whether a respective branch is predicted as being taken or 
not, and plural AND gates 564. The AND gates receive one 
input from the associated selector circuitry and one from a 5Q 
respective entry of the register. Thus, considering instruction 
No 7 of FIG. 1, the prediction processing circuitry 814 of the 
fourth partition will either produce a logical 1 from a 
respective AND gate on one of the output lines 815 (if the 
branch instruction 7 is predicted as taken) or will produce all 55 
logical 0's on the output lines 815 (if the branch instruction 
7 is predicted as not taken). 

For the first situation, namely that where the branch 
instruction is predicted as taken, the presence of a logical 1 
on the output line 815 which corresponds to the position of 60 
the branch instruction information stored in the content 
addressable memory 810 causes: 

1. The output of NOR gate 816 to go to logical 0 and; 

2. The corresponding row of the data RAM 818 to be 
addressed, thus outputting onto the target address out- 65 
put bus 503. The target address "00000", i.e. the target 
address of the branch instruction. 
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It will be noted that in the remaining, i.e. first-third, 
partitions 401-403, there will be no logical l's at the output 
of the corresponding prediction processing circuitry 514, 
614, 714. As a result, NOR gates 516, 616, 716 will have a 
logical 1 output which ensures that the corresponding trans- 
mission gates 530,630 etc; 531,631 etc are conductive and 
that the corresponding lines of the predicted-taken bus 505 
will be at logical 1, save that line which is derived from the 
fourth partition 404. This allows the output from partition 
404 to be output on lines 503 and 504. 

For a second example, assume that fourth instruction 
word 104 in the fetched sequence is the first instruction 
(instruction 12) to be executed. 

Now, reference to FIG. 1 shows that instruction 12 
(having address 01100) is a non-conditional jump. 
Accordingly, every time execution proceeds to this 
instruction, the jump is taken and provided instruction 12 is 
stored in the branch target buffer, the branch target buffer 
circuitry will provide a logical 1 on one output line 
515 1 -515 n of prediction processing store circuitry 514 in the 
first partition 401. It will be appreciated that the fact of 
taking branch instruction 12 means that execution will not 
directly proceed to instructions 13, 14 or 15. As a result, the 
predicted outcome of branch instruction 14 is irrelevant 
because that instruction will not be executed in sequence 
with instruction 12. The branch target buffer takes this into 
account because the NOR gate 516 of the first partition 
provides a logical 0 output over its output line 517 to render 
non-conductive the transmission gates 530, 531, thus iso- 
lating the second-fourth partitions 402, 403, 404. The sec- 
ond set of transmission gates 530 prevents the output of 
target data from the data RAMs of the other partitions and, 
as will later be described herein, the first transmission gate 
531 prevents the prediction strength of predicted unexecuted 
branch instructions from being updated. Although this pri- 
oritising feature has been described for the unconditional 
branch instruction 12, it should be noted that the branch 
target buffer is not aware that 12 is different to any other 
predicted-taken branch. The buffer will therefore treat any 
other predicted- taken branch in the same way, i.e. act to 
exclude any output for later branches in the same word. 

Consider a further example in which execution of instruc- 
tion word 102 is to commence from the first instruction of 
that word (instruction 4 of FIG. 1, having address "00100"). 
Assume for the purpose of this example that instruction 5 
(00101) is not predicted as taken and that instruction 7 
(00111) is predicted taken. The instruction word address is 
input over instruction word address input bus 500, which 
will produce logical 1 CAM "hits" in the second and fourth 
partitions 402, 404. The byte address "00" is input over 
instruction byte address input bus 501 to the selection 
circuitry 512, 612, 712, 812 of the partitions. As mentioned 
above, the "00" address causes each of the selection circuitry 
to pass hits from input to output, to provide corresponding 
logical l's to the prediction processing circuitry. In the 
presently described example, such logical l's occur in the 
second and fourth partitions 402 and 404. In the second 
partition 402, the relevant register of the prediction process- 
ing circuitry stores a "not-taken" prediction, and as a result 
all of the outputs of the prediction processing circuitry 614 
remain logical 0, indicating no branch predicted taken. By 
contrast, the prediction processing circuitry 814 stores a 
"taken" prediction in the position corresponding to word 102 
and the application of a logical 1 "hit" provides a jump taken 
output on one of the lines 815. This: 

a) Provides a logical 0 at the output of the associated NOR 
gate 816; and 
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b) causes the data RAM 818 of the fourth partition to 
output a target address of "00000". As none of the 
"lower" partitions has a non-conductive transmission 
gate, this target address is passed through to the target 
address output bus 503. 

As a result it can be seen that the branch target buffer 
described above identifies only the first predicted-taken 
branch instruction of a sequence, which is not excluded for 
execution by being prior to the first instruction of the 
sequence to be executed, and, more specifically, the target 
address of that instruction. 

As previously discussed with respect to FIG. 4, each 
partition of the branch target buffer includes circuitry 550 
known herein as prediction strength processing circuitry for 
storing information based on the history of the branch 
instructions identified in the corresponding partition. As 
previously noted, FIG. 4 represents an exemplary partition 
401 and the prediction strength processing circuitry 550 has 
counterparts 650, 750, 850 in the other partitions. 

Prediction strength processing circuitry 550 receives 
inputs from each of the selection circuitry output lines 
513 1 -513 rt . A logical 1 will occur on one of those output 
lines 513 a ~513„ when a match occurs between the word 
address input on bus 500 and data indicating the presence of 
a branch instruction stored in one of the CAM cells 
SlOi-510^ provided the selector circuitry 512 has not 
disabled the partition because the instruction in that partition 
is prior to a first executed instruction of the relevant 
sequence. Each of the inputs provides a first input to a 
respective AND gate 570 1 -570„. The other input to each 
AND gate is derived from a respective entry 571 1 -571 7l of 
a store 571 in the prediction strength processing circuitry 
which stores information indicative of whether there is 
associated with the corresponding instruction a so-called 
"weak" prediction or a so-called "strong" prediction. In the 
present embodiment, a weak prediction is represented by a 
logical 1 stored in the corresponding stage, and a strong 
prediction is a logical 0 stored in the corresponding stage. 

A strong prediction indicates that a high degree of con- 
fidence that the presently stored prediction is correct, and 
therefore unlikely to be changed whereas a weak prediction 
indicates a lower degree of confidence in the correctness of 
the present prediction, and a greater likelihood of change. 

The lines 513 3 -513 n also form one input to respective 
AND gates 572 1 -572„. The other inputs to the AND gates 
572!-572 n are provided by the update enable line 502, 
which is common to all those gates. The output of each AND 
gate 572 a -572 n controls a respective pull down transistor 
573 1 -573 rt . The pull down transistors 573 1 -573„ are con- 
nected to a corresponding entry 571^571^ of the prediction 
strength store 571. The output of the AND gates 570 1 -570 n 
form inputs to an n-input OR gate 574 having output 504 
which includes an output prediction strength line for each 
partition. 

In operation, when a logical 1 appears on one of the input 
lines SLSj-S^ (indicating that a branch instruction has 
been "hit" in the partition) that logical 1 is applied to one of 
the AND gates 570^570^. The AND gate will produce 
logical 1 output only if the corresponding stage of the 
prediction strength store 571 stores a logical 1, correspond- 
ing to a "weak" prediction. If this occurs, then one of the 
inputs to the OR gate 574 will be at logical 1 and the output 
line 504 has a logical 1 indicating the prediction is weak. 
The logical 1 on the one of the input lines 513 1 -513 n is also 
applied to one input of a respective one of the AND gates 
572,-572* 

The output of the AND gate 572 a -572„ will be at logical 
0 unless the update enable line 502 is at logical 1 which 
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causes an automatic update of prediction strength to an 
interim new value. In this event, one of the AND gates 
572 1 -572„ will have a logical 1 output, which causes the 
associated one of the pull down transistors 573 1 -573 n to turn 
on, pulling the corresponding stage 571 1 -571 n of the pre- 
diction store 571 to be pulled to logical 0, thus causing the 
prediction value stored in that stage to change to logical 0 
indicating a strong prediction or remain at logical 0 if it was 
already in that state. 

As previously noted, there is a respective transmission 
gate 531,631 etc connected in the update enable line 502 
between each partition and the next higher partition. It will 
be recalled that this transmission gate is conductive unless 
the relevant partition has identified that a branch instruction 
is predicted taken. In that event, the transmission gate is 
rendered nonconductive during the operating cycle by the 
respective line 517, 617, 717, 817 going to logical 0. At the 
start of each cycle, the transmission gates are conductive, 
and a logical zero is applied via the line 502 to all partitions 
as a precharge level. The logical zero is then disconnected 
but the line remains at that level. Once transmission gates 
531, 631 have gone non -conductive in partitions where a 
branch is predicted taken, a logical one is applied to the line 
502. The consequence of this is that a logical 1 on the update 
enable line is input to each partition in ascending order up 
to and including any partition in which a branch instruction 
is predicted as taken. Later partitions, regardless of whether 
or not they contain predicted-taken branch instructions do 
not receive the logical one level needed to update the 
prediction and thus are not automatically updated. 

If during a fetch operation an instruction word is recog- 
nised as having plural branch instructions, it is desirable that 
the above-discussed automatic updating take place for all 
those branch instructions which are not excluded from 
execution by virtue only of the initial execution point within 
the word, up to and including the first predicted -taken 
branch. As an example, if all of the partitions 401-404 stored 
a jump instruction for a particular instruction word and if 
execution were to start from the branch instruction in 
partition 402 (predicted not-taken) and the instruction in 
partition 403 were predicted taken, then no update should be 
performed on the predictions stored in partition 401 
(because this instruction could not be executed) and no 
update should be performed upon the instruction represented 
by partition 404 (because the instruction represented by 
partition 403 is predicted as taken, thus preempting any 
judgements on the instruction represented by partition 404). 

This update strategy is provided for in the branch target 
buffer in the following way: 

Since in this example the byte address input over byte 
address input bus 501 causes the selection circuitry 512 to 
disable the first partition 401, there will be no logical is on 
any of the lines 513. Thus no logical ones will be applied to 
the prediction strength circuitry 550 and the output of the 
NOR gate 516 remains at logical 1. Gates 530,531 remain 
conductive. Although there will be a CAM hit in the second 
partition 402, and the selection circuitry 612 allows the 
corresponding logical 1 to be applied to the prediction 
processing circuitry 614, the 1 not-taken' prediction has the 
consequence of providing all logical 0s at the lines 615. Thus 
NOR gate 616 has a logical 1 output and gates 630,631 are 
conductive. The appearance of a predicted-taken result in the 
third partition 403 causes the NOR gate 716 of that partition 
to provide a logical 0 output, thus rendering non-conductive 
the transmission gate 731. Hence the update line 502 is 
connected to positions 401-403, and disconnected from 
partition 404. No updating in partition 401 occurs because a 
logical 1 is required on one of the lines 513 for this to occur. 
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The prediction strength information, as mentioned above, 
provides certain information on the history of identified 
branch instructions. 

Since the prediction of the outcome of a branch — i.e. 
whether or not a jump instruction is taken — is required at the 
time of fetching, rather than executing, the branch 
instruction, it is necessary to update the prediction for use 
the next time that particular jump instruction is fetched, 
depending on whether or not the prediction currently being 
made is found to be correct or not during actual execution. 
The strategy adopted in the presently-described embodiment 
is to update automatically to an interim new prediction on 
the assumption that the present prediction is, in fact, correct. 
If the interim prediction is found to have been correct when 
the branch is resolved, then the interim prediction is retained 
as the new prediction for the next execution, referred to 
herein as the resolved prediction; only if the present predic- 
tion is found to have been incorrect is there a need for 
correction and a corrected 'resolved prediction* is stored. 
The branch target buffer defaults to a state in which no 
branch is predicted taken for any cycle in which corrections 
are being undertaken. This enables the branch target buffer 
to be implemented as a single-port device, the single port 
being alternatively used for reading out of predictions and 
writing in of predictions. 

Referring to FIGS. 6 and 7, the process of updating the 
prediction and prediction strength will be described. Both 
FIGS. 6 and 7 show the history of a branch instruction which 
is entered into the branch target buffer during an entry cycle 
(E). For this explanation, it is assumed that each time the 
branch instruction is fetched, the previous execution of the 
instruction has been resolved. In practice, it may be possible 
for the instruction to be fetched again before a previous 
execution has been completed, as will be later described 
herein. On entry, the prediction and prediction-strength for 
any newly-entered jump instruction is "weakly -taken" (wT). 
The prediction and strength are stored as a logical 1 
(indicating "taken") in the prediction store 563, 663, 763, 
863 and as logical 1 (indicating "weak") in the prediction 
strength store 591 etc, to form the prediction for that 
instruction when it is executed next time. 

In both FIGS. 6 and 7, cycle C a represents the next cycle 
in which the presently-considered instruction is fetched, and 
the resolved prediction for the previous cycle represents the 
prediction state at the time of next fetching of the 
instruction, i.e. the start of the cycle C r Similarly, C 2 
represents the second occasion at which the branch instruc- 
tion is fetched, and the resolved prediction for cycle C 1 
represents the predicted value for cycle C 2 , and so on. In 
each cycle, the prediction is updated to provide an interim 
update by assuming that the prediction at the start of the 
cycle was correct. If execution of the instruction shows that 
the prediction was in fact wrong, the interim update is 
discarded, and a new prediction made. This new prediction 
is based upon the previous prediction, at the start of the 
cycle, and the knowledge gained during resolution. 

In the presently-described embodiment, a branch instruc- 
tion is put in the BTB only when it is executed to cause a 
jump from the normal sequence. Thus at the end of the entry 
cycle, after resolution, the history of the jump instruction is 
"T" (Taken once). 

Referring to FIG. 6 the progress of a jump instruction 
which is initially correctly-predicted will be described: 

At the start of cycle C l9 the prediction is "weakly taken" 
and it is assumed by the branch target buffer that the jump 
will in fact be taken. Thus the prediction state is updated by 
the update enable line 502 and the automatic update circuitry 
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572, 573 etc to "strongly-taken". In this case, the jump is 
taken. There is thus no need to correct the interim prediction, 
which becomes the resolved prediction at the end of the 
cycle which includes fetching and executing the instruction. 

5 At the start of cycle C^, the prediction is "strongly taken", 
and the jump is resolved as "taken". The interim update 
prediction remains strongly taken and, as the jump is 
resolved as being taken, the resolved prediction is likewise 
"strongly taken". 

10 However, if in cycle Gj the prediction is incorrect, in that 
the jump is resolved as being not taken then the following 
applies: 

At a start of cycle the resolved prediction was 
"strongly taken" and the interim update is thus "strongly 
15 taken". However, as the jump is resolved as "not-taken" the 
prediction requires updating to "weakly taken" as shown. 
Thus it will be seen that the strength not the prediction is 
changed. 

For the next cycle, C 4 , the prediction value is still "taken", 

20 although "weakly-taken". Thus the interim update will be 
from "weakly-taken" to "strongly-taken", on the assumption 
that the predicted behaviour is correct. If however once 
again the jump is resolved as being not-taken, the resolved 
prediction must be corrected to "weakly-not -taken", in other 

25 words the correction between the resolved prediction of 
cycle C 3 to the resolved prediction of cycle C 4 requires the 
prediction to be changed, rather than the strength to be 
changed i.e. from "weakly-taken" to "weakly-not-taken". 
This change is made by an associative look-up in the present 

30 embodiment. 

Finally at the start of cycle C 5 the prediction value is 
"weakly not taken", and accordingly the "interim update" is 
to "strongly not taken". If the jump is resolved as "not taken" 
the resolved prediction is "strongly not taken" whereas if the 

35 jump is resolved as taken, then the resolved prediction 
would be "weakly taken". 

Turning to FIG. 7, the progress of a jump instruction is 
shown, in which the first resolution of the jump instruction 
after entry into the branch target buffer is incorrect. Thus, in 

40 cycle Cj the interim update, assuming that the prediction 
value of "weakly taken" is correct, is to "strongly-taken". 
However, the jump is resolved as "not-taken". As a result, 
the resolved prediction is "weakly-not-taken", in other 
words requiring the prediction at the start of the cycle, which 

45 was "weakly taken" to be corrected to "weakly not taken". 
At the start of cycle Q the "weakly not taken" prediction 
gives an interim update of "strongly not taken", and the 
resolution of the jump as "not taken" confirms this value as 
the resolved prediction. However, in cycle C 3 , although the 

50 interim update is to remain at "strongly not taken", the 
resolution is "taken" thus the resolved prediction at the end 
of cycle C 3 is "weakly not taken", i.e. once again changing 
only the strength of the prediction, rather than the prediction 
itself. This should be contrasted with cycle C4, in which the 

55 predicted value of "weakly not taken", at the start of the 
cycle is updated to "strongly not taken", but as a result of a 
resolution to "taken" is finally corrected to "weakly taken", 
in other words a change of prediction value, not of predic- 
tion strength. 

60 The above discussion of the branch target buffer generally 
relates to the circuitry in operation of the buffer in the 
reading mode. Details of the circuitry for writing informa- 
tion to an exemplary line of the branch target buffer will now 
be described with reference to FIG. 8. FIG. 8 shows an 

65 exemplary content addressable memory cell 510 fc , being one 
of the memory cells 510 in the first partition 401. This 
content addressable memory cell 510 fc for the purpose of 



